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Entanglement, according to Erwin Schrodinger the essence of 
quantum mechanics, is at the heart of the Einstein-Podolsky-Rosen 
paradox and of the so called quantum-nonlocality - the fact that 
a local realistic explanation of quantum mechanics is not possi- 
ble as quantitatively expressed by violation of Bell's inequalities. 
Even as entanglement gains increasing importance in most quan- 
tum information processing protocols, its conceptual foundation 
is still widely debated. Among the open questions are: What is 
the conceptual meaning of quantum entanglement? What are the 
most general constraints imposed by local realism? Which general 
quantum states violate these constraints? Developing Schrodinger's 
ideas in an information-theoretic context we suggest that a natural 
understanding of quantum entanglement results when one accepts 
(1) that the amount of information per elementary system is finite 
and (2) that the information in a composite system resides more 
in the correlations than in properties of individuals. The quanti- 
tative formulation of these ideas leads to a rather natural criterion 
of quantum entanglement. Independently, extending Bell's original 
ideas, we obtain a single general Bell inequality that summarizes 
all possible constraints imposed by local realism on the correlations 
for a multi-particle system. Violation of the general Bell inequality 
results in an independent general criterion for quantum entangle- 
ment. Most importantly, the two criteria agree in essence, though 
the two approaches are conceptually very different. This concur- 
rence strongly supports the information-theoretic interpretation of 
quantum entanglement and of quantum physics in general. 
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I. INTRODUCTION 

In their seminal paper in 1935 Einstein, Podolsky and 
Rosen (EPR) consider quantum systems consisting of two 
particles such that, while neither position nor momentum 
of either particle is well defined, both the sum of their 
positions and the difference of their momenta are both 
precisely defined. It then follows that measurement on 
either position or momentum performed on, say, particle 
1 immediately implies for particle 2 a precise position 
or momentum respectively even when the two particles 



are separated by arbitrary distances without any actual 
interaction between them. 

Motivated by EPR Schrodinger (1935a) in his paper 
entitled "The present situation in quantum mechanics" 
wrote succinctly "Maximal knowledge of a total system 
does not necessarily include total knowledge of all its 
parts, not even when these are fully separated from each 
other and at the moment are not influencing each other 
at all" and he coined the term 1 "entanglement of our 
knowledge" to describe this situation. A central goal of 
our work is to take Schrodinger's position as a starting 
point for a quantitative information-theoretic definition 
of entanglement. Therefore, carefully reading his sen- 
tence one may identify three independent ideas on which 
Schrodinger builds his notion of entanglement, (a) First, 
our knowledge, or total information, of a system is bound, 
(b) Second, the total information of a composite system 
is not necessarily fully contained in its individual con- 
stituents, (c) And the third, these statements are inde- 
pendent of the relative space-time arrangements of the in- 
dividual observations on the constituents of the compos- 
ite system. Furthermore we underline that Schrodinger 
talks about quantum states representing our expectation 
catalogs and he carefully avoids the notion of properties 
of systems. 

In 1964 John Bell obtained certain bounds (Bell in- 
equalities) on combinations of statistical correlations for 
measurements on two-particle systems if these correla- 
tions are understood within a realistic picture based on 
local hidden properties of each individual particle. In a 
realistic picture the measurement results are determined 
by properties the particles carry prior to and independent 
of observation. In a local picture the results obtained 
at one location are independent of any measurements or 
actions performed at space-like separation. Then Bell 
showed that quantum mechanics predicts violation of 
these constrains for certain statistical predictions for two- 
particle systems. A more striking conflict between quan- 
tum mechanical and local realistic predictions even for 
perfect correlations has been discovered for three and 
more particles (Greenberger et al, 1989; 1990; Mermin, 
1990), resulting in an outright invalidation of the EPR 



1 Schrodinger (1935a) used the German original "Ver- 
schrankung" and he himself introduced the English transla- 
tion "entanglement" (Schrodinger, 1935b). 



1 



concepts. By now a number of experiments (Freedman 
and Clauser, 1972; Aspect et al, 1981; Pan et al, 2000) 
have confirmed the quantum mechanical predictions even 
if the individual particles are truly space-like separated 
(Wcihs et al, 1998). However two important questions 
remain: (1) "What are the most general constraints on 
correlations imposed by local realism?" and (2) "Which 
quantum states violate these constraints?". The latter 
has been solved in general only in the case of two par- 
ticles in pure states (Gisin, 1991; Gisin and Peres 1992) 
and for two-qubit mixed states (Horodecki et al., 1995). 
Only recently bounds for local realistic description of 
higher-dimensional systems have been found in some sim- 
ple cases (Kaszlikowski et al, 2000). 

Our paper will now develop the ideas presented 
above in various quantitative ways. Following first 
Schrodinger's ideas we will define quantum entanglement 
as a feature of a composite system to have more infor- 
mation contained in correlations than any classical mix- 
ture of its individual constituents could ever have. The 
essence of classical correlations is that there the joint 
properties can be reduced to correlations between proper- 
ties of the individual constituents. This is an operational 
definition because information is always defined through 
observation of measurement results. 

In parallel, following then Bell's ideas, we will obtain a 
single general Bell inequality that summarizes all possible 
local realistic constraints on the correlations for a multi- 
qubit system, where two dichotomic observables are mea- 
sured on each individual qubit. This enables us to intro- 
duce another operational definition of entangled states as 
those which violate that general Bell inequality in a direct 
measurement 2 . Finally we show that the two operational 
definitions are equivalent in the two-qubit case and that 
in general our informational definition of entanglement 
provides a necessary and rather stringent condition for 
violating the general Bell inequality. We find this intrigu- 
ing because the two operational approaches are based on 
completely different concepts. To us this further sup- 
ports the view that information is the most fundamental 
concept in quantum physics (Zeilingcr, 1999). 



2 By an entangled state is often meant a state which cannot 
be represented as a classical mixture of product states (non- 
separable state). Here, since our aim is to relate Schrodinger's 
notion of entanglement to the one via Bell's inequality we 
will use the definition as given in the text above. However, 
it is well known that there are cases of " hidden non-locality" 
where a quantum state initially does not directly violate a Bell 
inequality but after local operations together with classical 
communication such a violation might occur (Popescu, 1995). 



II. FINITENESS OF INFORMATION 

A central point in our discussion will be the differ- 
ent ways how information can be distributed within a 
composite system. We therefore have to introduce our 
notion of information and we have to give our consider- 
ations about how much information altogether a system 
can represent. 

Any physical description of a physical system is a set 
of propositions together with their truth values - true 
or false. Then, any proposition we might assign to 
a quantum system which is always based on observa- 
tion of properties of the classical apparatus used, repre- 
sents our knowledge, i.e., information, of a system gained 
through observation. To illustrate this consider the state 
\ip) = \z+) of a spin-1/2 particle with spin up along the 
x-axis, which is an eigenstate of the operator a z with 
eigenvalue +1. This simply means that the quantum 
system described by the state will be found with cer- 
tainty to have spin +1 if it is measured along the z-axis. 
Thus the information content in that state is represented 
by the truth value of the proposition: "The spin along 
the z-axis is up." This is one bit of information. It is 
clear that both truth values must be possible. Only then 
observation of the system can result in a gain of infor- 
mation. In agreement with Schrodinger's idea (a) above 
the most simple system then just represents one bit of 
information. This is what we mean when we talk about 
a system carrying information. To us a system is a con- 
struct based on information. 

If we assume that one bit is the only information the 
most simple quantum system can carry, and that this 
is defined with respect to a certain measurement then 
other measurements must contain an element of irre- 
ducible randomness. Otherwise the system would carry 
more information in conflict with (a) for a system rep- 
resenting one bit of information only. This means that 
there are other measurement directions for which the ex- 
perimental outcome is completely random. Specifically, 
for a measurement along any direction n in the x-y plane 
the proposition " The spin along the n axis is up" is com- 
pletely indefinite, i.e., we have absolutely no knowledge 
which particular outcome "spin up" or "spin down" will 
be observed in an individual experimental trial. The two 
propositions about spin along z and about spin along n 
are propositions with a property of mutual exclusiveness. 
This is quantum complementarity: the complete knowl- 
edge of the truth value of one of the propositions implies 
maximal uncertainty about the truth values of the other. 

How much information is carried by a system with re- 
spect to a specific set of mutually complementary propo- 
sitions? We suggest that it is natural to assume that the 
information contained in a set of mutually complemen- 
tary propositions is the sum over the measures of infor- 
mation of the individual members of that particular set 
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(Bohr, 1958). Specifically, to obtain the total informa- 
tion carried by a quantum system one summarizes over 
all individual measures of information for a complete set 
of mutually complementary measurements, as shown in 
Brukncr and Zeilinger (1999). There it was shown that 
the total information carried by the composite system 
consisting of N qubits in a pure state to be N bits of 
information. 

Here we are interested in the various ways how infor- 
mation can be distributed within a composite system. In 
particular we will consider that part of the total infor- 
mation of the system which is contained in correlations, 
or joint properties of its constituents. This is also the 
reason why here we do not consider complete sets of mu- 
tually complementary propositions for the composite sys- 
tem but just a subset of them concerning joint properties 
of its constituents. 

III. INFORMATION CONTAINED IN CORRELATIONS 

Correlations between quantum systems have assumed 
a very central role in the discussions of the foundations of 
quantum mechanics. We will now investigate how much 
information can be contained in such correlations in order 
to give an information-theoretic criterion of quantum en- 
tanglement. As it is our final goal to compare that crite- 
rion with the one given by Bell-type inequalities (Clauscr 
et al, 1969; Mermin, 1990; Belinskii and Klyshko, 1993) 
where one considers correlations between spin measure- 
ments confined on each side within one plane we restrict 
our analysis to an x-y plane locally defined for each sub- 
system. 

For clarity of presentation we first investigate the case 
of a two-qubit system carrying therefore N=2 bits of in- 
formation, i.e. representing the truth value of two propo- 
sitions. That information contained in 2 propositions can 
be distributed over the 2 qubits in various ways. 

Consider first a product state e.g. \ip) = | + x)i| — x} 2 - 
Here the state \tp) represents the two-bit combination 
true-false of the truth values of the propositions about 
the spin of each particle along the x-axis: (1) "The spin 
of particle 1 is up along x" and (2) "The spin of parti- 
cle 2 is up along x". Instead of the second proposition 
describing the spin of particle 2, we could alternatively 
choose a proposition which describes the result of a joint 
observation: (3) "The two spins are the same along x." 
Then the state represents the two-bit combination 
true- false of the truth values of the propositions (1) and 
(3). Note that this is also present in classical composite 
systems. 

Evidently, for pure product states at most one propo- 
sition with definite truth-value can be made about joint 
properties because one proposition has to be used up to 
define a property of one of the two subsystems. In other 
words 1 bit of information defines the correlations 
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In our example where \ip) — | + x)i| — x) 2 the correla- 
tions are fully represented by the correlations between 
x-measurements on both sides, therefore I^rf = 1 = 
I xx = 1. The specific measure of information used by us 
will be specified below. 

Obviously, the choice of directions within the planes 
of measurement on the two sides is arbitrary. Choos- 
ing general x and y directions we request that the total 
information contained in the correlations must be invari- 
ant upon this choice. This invariance property by itself 
already defines the specific measure of information and 
it rules out Shannon's measure (Brukncr and Zeilinger, 
2001). Following our arguments given above we now de- 
fine the information contained in the correlations as the 
sum over the individual measures of information carried 
in a complete set of mutually complementary observa- 
tions within x-y plane. Therefore the information con- 
tained in the correlations is quantified by the sum 

Icorr Ixx ~l~ ^xy ~l~ lyx ~l~ lyy (2) 

of the partial measures of information contained in the 
set of complementary observations within the x-y-planc. 
These observations are mutually complementary for prod- 
uct states and the set is complete as there exists no 
further complementary observation within the chosen 
planes. By this we mean that for any product state a 
complete knowledge contained in one of the observations 
in Eq. (2) excludes any knowledge content in the other 
three observations. 

Consider now a maximally entangled Bell state, e.g. 

\<t>~) = -^=(i + x) 1 | -x) 2 + i-x)i|+x) 2 ) 

= ±=(] + y) 1 \ + y) 2 + \-y) 1 \-y) 2 ). (3) 

The two propositions here both are statements about re- 
sults of joint observations (Zeilinger 1997), namely (1') 
"The two spins are equal along x" and (2') "The two spins 
are equal along y" . Now the state represents the two-bit 
combination false-true of these propositions about cor- 
relations. Note that here the 2 bits of information are 
all carried by the 2 qubits in a joint way, with no in- 
dividual qubit carrying any information on its own. In 
other words, as the two available bits of information are 
already exhausted in defining joint properties, no further 
possibility exists to also encode information in individu- 
als. Therefore 

jBell _ q (a\ 
*corr ^' V^ 1 / 

Note that in our example I xx = I yy = 1 and I xy — I yx = 
0. Also, note that the truth value for another proposition, 
namely, "The two spins are equal along z" must follow 



3 



immediately from the truth values of the propositions (1') 
and (2'), as only 2 bits of information are available. Inter- 
estingly this is also a direct consequence of the formalism 
of quantum mechanics as the joint eigenstate of o~\o~1. and 
of OyOy is also the eigenstate of a\a\ = — (o^afXa^o^). 

In contrast to product states we suggest entanglement 
of two qubits to be defined in general such that more than 
one bit (of the two available ones) is used to define joint 
properties, i.e. 

I%» ld > 1 (5) 

for at least one choice of the local x and y directions for 
the two qubits. This is in agreement with Schrodinger's 
idea (b) . Equivalcntly we suggest to define the two qubits 
as classically composed (non-entangled) if less than or 
equal to one bit of information is used to define correla- 
tions, i.e. 

1 corr — 1 \°/ 

for all possible choices of the local x and y directions 
for two qubits. As we will show below, the independent 
information-theoretic definition of entanglement (5) will 
turn out to be equivalent to a necessary and sufficient 
condition for a violation of any Bell-type inequality for 
two-qubits. 

In the generalization to more and more qubits we con- 
sider, without loss of generality, a product state \tjj) = 
\+xi)\+x 2 )---\+xn) of N qubits. Here, Xj denotes a spa- 
tial direction in a local coordinate system of observer j. 
Then only one proposition with definite truth-value can 
be made about the correlations in the N qubits, namely 
the proposition (*): "The product of spin of particle 1 
along xi, spin of particle 2 along x 2 , ... and spin of par- 
ticle N along a; 7v is +1". This means that for a product 
state or a classical mixture of product states again at 
most one bit is represented in N-qubit correlations. We 
therefore here too suggest to define N qubits as classi- 
cally composed if not more than one bit of information 
is used to define correlations, i.e. 

j-nonent \ j ^ -i (rj\ 

1 corr ~ / / - l x 1 ...x N 2= 1 {• ) 

X!,...,»jy 

=x,y 

for all possible choices of local x and y directions for N 
qubits. Here the sum is over the measures of informa- 
tion over a set of propositions of the type given above 
where x\, ...,Xn & {x,y} and which therefore are mutu- 
ally complementary for product states 3 . 



3 Since the set of propositions entering Eq. (7) contains 
also propositions which are not mutually complementary for 
a non-product state the sum in (7) does not give the value of 



We would like to make a very general comment. The 
quantitative condition (7) while certainly correct for the 
situations discussed here might have to be modified in 
order to apply to more complicated cases like entangle- 
ment between many qubits when the measurements are 
not restricted to one plane or entanglement between sys- 
tems defined in Hilbert spaces of higher dimensions, so- 
called qunits. Another interesting case can arise when 
one also considers in detail all possible sets of correla- 
tions between all possible sets of subsystems. For ex- 
ample for 3-qubit systems we have one correlation be- 
tween all three individual qubits, we have 3 correlations 
between two individual qubits and we have another 3 
correlations between one qubit and the other two. This 
results in a large number of conditions of the type of Eq. 
(7) which are not independent from each other in general. 
Yet, we stress, that it is to be expected that the general 
ideas laid out here will still be applicable. A most impor- 
tant guidance for quantitative conditions being that the 
information carried by the correlations between subsys- 
tems exceeds the limit given by the information carried 
by the subsystems themselves. 

Finally we stress that in our information-theoretic 
analysis of entanglement we did not have to use concepts 
like spatial separation between subsystems of the com- 
posite system or the relative times of the observations on 
the subsystems. This is in agreement with Schrodinger's 
idea (c). 

To this point we did not yet specify any particular 
measure of information. Thus the question arises which 
particular measure of information is adequate to define 
the information gain in an individual quantum experi- 
ment. 

IV. QUANTIFYING INFORMATION 

Consider an experiment with two outcomes "yes" and 
" no" and with the probabilities p\ and p 2 = 1 — pi , re- 
spectively for the two outcomes. Within finite time the 
experimenter can perform only a finite number of experi- 
mental trials. Because of inherent fluctuations associated 



information in bits contained in correlations for such states. 
It is then not surprising that, for example, in the case of a 
three-qubit GHZ state the sum results in 4 (I X yy — lyxy — 
Iyyx = Ixxx = 1, other are zero). Here however only three of 
the four propositions are independent, as only 3 bits of infor- 
mation are available. This again is also a direct consequence 
of the formalism of quantum mechanics as the joint eigen- 
state of ala-ydy, a\ j a 2 x a'y and (Jya 2 <j x is also an eigenstate of 
o'xO'xO'x = — (o'x^y^y)( a y a x f7 y)( a y f7 y f7 x)- Nevertheless when 
the sum in (7) is larger than unity, this indicates that the 
total information contained in correlations is larger than one 
bit. 
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with any probabilistic experiment with a finite number 
of trials the number of occurrences of a specific outcome 
in future repetitions of the experiment is not precisely 
predictable. Rather it obeys the binomial distribution 
(See e.g. Gnedenko, 1976). 

If one bets for example that the number of " yes" out- 
comes will be the one with highest probability the prob- 
ability of success still depends on p\ . With a probability 
of pi — 0.5, the probability of 5 "yes" outcomes in 10 
trials is only 0.25, but with pi = 0.9 the probability of 
9 "yes" outcomes in 10 trials is 0.39. It is a trait of 
the binomial distribution, that we know the future num- 
ber of occurrences of the outcomes very well if p\ (or 
equivalently p-i) is close to or 1, but we know much 
less about them when p\ is around 0.5. Note that this 
follows from elementary probability theory without any 
input from physics . In (Brukner and Zeilinger 1999) it 
was shown that this knowledge is properly represented 
by the measure 

I=(pi-P2?. (8) 

This attains its maximal value of unity when one of prob- 
abilities is one, and it attains its minimal value of when 
both probabilities are equal. 

Note that all our propositions about joint properties 
are binary propositions, i.e. they are associated to ex- 
periments with two possible outcomes, one of them being 
"the product of spin of particle 1 along x\, spin of parti- 
cle 2 along X2, ■■■ and spin of particle N along xn is +1", 
and the other one " the product of spin of particle 1 along 



4 Here a very subtle position was assumed by von Weizsacker 
(1975) who wrote: "It is most important to see that this 
[the fact that probability is not a prediction of the precise 
value of the relative frequency] is not a particular weakness 
of the objective empirical use of the concept of probability, 
but a feature of the objective empirical use of any quantita- 
tive concept. If you predict that some physical quantity, say 
a temperature, will have a certain value when measured, this 
prediction also means its expection value within a statistical 
ensemble of measurements. The same statement applies to 
the empirical quantity called relative frequency. But here are 
two differences which are connected to each other. The first 
difference: In other empirical quantities the dispersion of the 
distribution is in most cases an independent empirical prop- 
erty of the distribution and can be altered by more precise 
measurements of other devices; in probability the dispersion 
is derived from the theory itself and depends on the absolute 
number of cases. The second difference: In other empirical 
quantities the discussion of their statistical distributions is 
done by another theory than the one to which they individ- 
ually belong, namely by the general theory of probability; in 
probability this discussion evidently belongs to the theory of 
this quantity, namely of probability itself. The second differ- 
ence explains the first one." 



x\, spin of particle 2 along X2, ■■■ and spin of particle N 
along xn is -1" , so that Eq. (8) can be applied. If we now 
denote the probabilities for the two outcomes by p+ ^ CJY 
and p Xl XN respectively, then the information contained 
in proposition (*) is given by 

Ix 1 ...X N — (Pxi...XN _ Pxi...X N ) ■ (9) 

Now we will express Eq. (9) in terms of the density 
matrix p of N qubits. First note that an arbitrary mixed 
state of N qubits can be written as 

1 3 

P=^n E r x,.. IN <®...®^ K (10) 

Xl,...,XAT=0 

where (Tq is the identity operator in the Hilbcrt space of 
particle j, and u x . is a Pauli operator for Xj = 1,2,3. 
Here the elements of the correlation tensor T are given 
as mean values of the product of the N spins, 

T X1 ... XN =Tr[p(^ i ® ...Oafj] =p+... xw -p~... XN 

(11) 

with T0...0 = 1. Then obviously our measure of infor- 
mation (9) is equal to the square of the corresponding 
element of the correlation tensor 

Ixi...x N — T Xl XN . (12) 

We have thus obtained a quantitative expression for 
the individual measures of information contained in the 
sum of Eq. (7) and we want to emphasize that our anal- 
ysis of entanglement would not be possible without the 
use of the measure of information (8). 

So far in the present paper we followed Schrodinger's 
concepts of entangled states in our information-theoretic 
analysis. Now we will follow Bell's ideas in a second, 
independent approach to characterize entanglement. 

V. ALL BELL INEQUALITIES FOR CORRELATIONS 

Here we obtain a single general Bell inequality that 
summarizes all possible constraints on the statistical cor- 
relations of an N-qubit system. These constraints are de- 
rived under the assumptions of local realism. We consider 
such correlation measurements where for each individual 
particle one of the two arbitrary dichotomic observables 
can be chosen. From this inequality we obtain as spe- 
cific corollaries the Clauser-Horne-Shimony-Holt (CHSH) 
inequality (Clauser et at, 1969) for two-qubit systems 
and the related inequalities for N qubits (Mermin, 1990; 
Ardehali, 1992; Belinskii and Klyshko, 1993). 

In a local realistic picture one assumes that the result 
of every measurement of an observable is predetermined. 
Thus in such a picture one implicitly requires an unlim- 
ited amount of information to be carried by an individual 
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particle, which conflicts with Schrodinger's idea (a). Take 
therefore an individual observer and allow him or her to 
be able to choose between two dichotomic observables 
(determined by some parameters denoted here Hi and 
H2) ■ This implies the existence of two numbers Aj{n\) 
and Aj(n,2) each taking values +1 or -1 which describe 
the predetermined result of a measurement by the ob- 
server of the observable defined by the local parameter 
n\ and H2, respectively. We choose such a notation for 
brevity; of course each observer can choose independently 
two arbitrary directions. 

In a specific run of the experiment the correlations be- 
tween all N observations can be represented by the prod- 
uct YijLi Ajifikj), with kj = 1,2. The correlation func- 
tion is then the average over many runs of the experiment 

E(n kl ,...,n kN ) = ^A j (n kj )\ . (13) 

« = 1 ' avg 

Note that for each observer j one has either \Aj{fii) + 
Aj(n,2) \ = and \Aj{n\) — A,(n 2 )| = 2 or the other way 
around. Then for all sign sequences of Si,...,Sn, where 
Sj € { — 1, 1} the modulus of the product | IT^Li [A? (™i) + 
SjAj(fi2)]\ vanishes except just one for which the product 
is 2 N . Therefore one has 

E \f[[s j A j (ni) + s 2 j A j (n 2 )]\=2 N . (14) 

=-1,1 J 

It then follows directly that the correlation functions 
must satisfy the following general Bell inequality (Wein- 
furter and Zukowski, 2001; Zukowski and Brukner, 2001; 
for an independent derivation see Werner and Wolf, 2001) 

E I E s1\..s k N » E(n kl ,...,n kN )\<2 N . (15) 

S 1.---. S JV fc 1 ,...,fc JV 
= -1,1 =1,2 

Therefore within each modulus we have sums of all 2 N 
correlation functions, which are the result of the Bell-type 
experiment, however each correlation functions is multi- 
plied by a specific sign ±. Using the generalized trian- 
gle inequality we obtain the set of Bell-type inequalities 
which are equivalent to inequality (15) 

I ES(si,-,sjv) E s k 1 \..s k N N E(n kl ,...,n kN )\<2 N , 

s l, - - ■ > s N &i , - ■ • > fc jV 

= -1,1 =1,2 

(16) 

where S{s\, Sjv) is one of the 2 2 sign functions of 
Si,...,Sn, by which we mean that its possible values can 
only be +1 or -1. Actually, inequalities (16) represent 
the complete set 5 of all possible 2 2N inequalities for the 



5 For an extensive classification of the inequalities see Werner 



correlations, one for each possible choice of the sign func- 
tion S. Many of these inequalities are trivial (for example 
when the choice is S(si, sn) — 1 for all arguments, we 
get that the modulus of the correlation function does not 
exceed 1). 

An exemplary stringent condition result 
for S(s u sn) = V2cos(-f + ( Sl + ... + s N - JV)f ) 
which leads to the series of inequalities derived by Be- 
linskii and Klyshko (1993). Specifically, for N = 2, the 
CHSH inequality 

\E(1, 1) + E(l, 2) + E(2, 1) - E(2, 2)| < 2 (17) 

follows. For N = 3, one obtains 

\E(1, 2, 2) + E(2, 1, 2) + E(2, 2, 1) - E(l, 1, 1)| < 2, (18) 

where here we use numbers 1 and 2 to denote directions 
ni and n 2 , respectively. Inequality (18) leads to the 
Greenberger-Horne-Zcilinger (Greenberger et al. 1989; 
1990) contradiction for an appropriate choice of local set- 
tings. In these cases the left hand side of inequality (18) 
reaches the value 4 which is the maximum possible value 
for any, not only quantum, correlation function. 

Thus far we have shown that when a local realistic 
model applies, the general Bell inequality (15) follows. 
The reverse is also true and we give the proof below: 
whenever inequality (15) holds one can construct a lo- 
cal realistic model for the correlation function. This 
establishes the general Bell inequality presented above 
as a necessary and sufficient condition for local realis- 
tic description of multi-particle correlations, where two 
dichotomic observables are measured on each individual 
particle. 

The proof of the sufficiency of condition (15) 
will be done here in a constructive way. Simply 
one ascribes to the set of predetermined local re- 
sults, which satisfy the following conditions Aj(n{) = 
SjAj(fi2), the hidden probability p(s\, sjv) = 
w\^k lt ...,k N s i 1 -~ sk N E (nk 1 , ■-,n kN )\, and one demands 
that the product YljLi Aj(n 2 ) has the same sign as that 
of the expression inside of the modulus defining the 
p(si, sn). In this way a definite set of local realis- 
tic values is ascribed a unique global hidden probability. 
Obviously, such defined probabilities are positive. How- 
ever due to inequality (15) they may add up to less than 
1. In such a case, the "missing" probability is ascribed 
to an arbitrary model of local realistic noise (e.g., for 
which all possible products of local results enter with 
equal weights) . The overall contribution of such a noise 



and Wolf (2001). For three qubits a complete set of in- 
equalities has been found numerically by Pitowsky and Svozil 
(2000). See also Pitowsky (1989) and Peres (1999). 
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term to the correlation function is zero. In this way we 
obtain a local realistic model of a certain correlation func- 
tion. However, one should check that this construction 
indeed reproduces the model for the correlation function 
for the set of settings that enter inequality (15), that is for 
E(n kl , ...,n kN ). For simplicity take N = 2. Notice that 
the expansion coefficients of the four-dimensional vec- 
tor (E(ni, ni), E(ni, ft?), E(n2, ni), E(n,2, ^2)) in terms 
of orthogonal basis vectors (s\S2, s\s2, S1S2, s i s 2) ( rc ~ 
call that S\,S2 € { — 1,1}) are equal to the 
expressions within the moduli entering inequality 
(15). Next notice, that by the construction shown 
above the local realistic model for N = 2 gives 
(E LV (ni,ni), E LV (n 1 ,n 2 ),E LV (n 2 , Hi), E LV (n 2l n 2 )) = 
3 E SllS2 E fel ,fe 2 8^a^E(fi kl , n k2 )( Sl s 2 , s\s 2 , Sl sl s\sl). 
Thus, since the vector built out of the correlation func- 
tion values and its local realistic counterpart have the 
same expansion coefficients, they are equal and the suf- 
ficiency of (15) as a condition for local realism is proven. 
The generalization to an arbitrary N is obvious. 

Above we derived the full set of Bell inequalities for 
multi-qubit correlations. This strictly defines the bound- 
ary of the validity of local realism. We will now dis- 
cuss states which violate such inequalities with the spe- 
cific aim of investigating the way information can be dis- 
tributed between the subsystems of such states. 



VI. N QUBITS THAT VIOLATE LOCAL REALISM 

Let us consider the general N-qubit state as in Eq. 
(10). Then the N-qubit quantum correlation function for 
a Bell-GHZ type experiment is 



EQ M {n kl ,...,n kN ) = Tr[p(<7 • ft kl 



cr -n k . 



(19) 



= E T xi-x N (n kl ) Xl ...(n kN ) XN (20) 

xi....,x n — 1 

where {n kj ) Xj (xj = 1,2,3) are the three Cartesian com- 
ponents of the vector n kj . Equation (19) means that 
the N-particle correlation function is fully defined by 
a tensor T (the indices of which can take values 1,2,3, 
and which belongs to R 3N ). For convenience we shall 
write down the last equation in a more compact way as 
E QM (n kl , ■■■ 1 n kN ) = (T,n kl <g> ... <g> n kN ), where (...,...) 
denotes the scalar product in R 3N . 

The necessary and sufficient condition (15) for a lo- 
cal realistic description of N-particle correlations implies 
that the quantum correlations for N qubits can always 
have a local and realistic model for the Bell-type experi- 
ment if and only if 



E |(T,]T^n fcl 

l,...,»iV fe 1 = i 



E 4 N ^ N )|<2 JV (21) 



for any possible choice n kl , n kN of each observer's two 
local settings Hi and n 2 . 

This condition can be simplified further, provided one 
notices that for each observer there always exist two mu- 
tually orthogonal unit vectors a\ and a 2 , independently 
defined for each observer, and the angle aj such that 
£^.=1%,- = 2aiCOs(a j + f) and £fc. =1 (-l) fe 'n fej = 
2a 2 cos(aj + 7r). Denoting with c Xj = cos(aj + Xj^) one 
can write the inequality (21) as 



E 



. . . c x 



,(f,a x 



< 1. 



(22) 



One can rewrite this inequality as 

E \ c x 1 ~-c XN (f,a Xl ®...®a XN )\<l (23) 



where T T 



is now a component of the tensor T in a 



new set of local coordinate systems, which among their 
basis vectors have a\ and a 2 which serve as the unite 
vectors which define the directions x and y. 

The necessary and sufficient condition for impossibility 
of any local realistic description of N-qubit correlations is 
that the maximum of the left-hand side of inequality (23) 
is larger than one. Once the values of the elements of the 
correlation tensor are given for the specific density matrix 
one can check via maximization procedure whether the 
local realistic description is possible. 

On the other hand, it is easy to notice that with the aid 
of the Cauchy-Schwarz inequality, one has an inequality 



\ c x 1 ---c Xn T Xi ... Xn \ < 



V T 2 



(24) 



Combining inequality (23) with (24) we obtain that a 
sufficient condition for the possibility of a local realistic 
description of the quantum N-qubit correlations in any 
Bell-type experiment is that 



E 



< 1 



(25) 



k N = l 



for all possible choices of local coordinate systems for N 
qubits as then the full set of Bell inequalities (16) is sat- 
isfied. Equivalcntly, if at least one of the Bell inequalities 
from the set is violated then condition (25) is violated for 
at least one choice of local coordinate systems. 

Our measure of information (12) is exactly equal to the 
square of the corresponding element of the quantum cor- 
relation matrix. This establishes the equivalence between 
the condition for local realism (25) on correlation tensor 
elements and our information-theoretical criterion (7) for 
information contained in correlations by states which do 
not reveal quantum entanglement. If the state violates at 
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least one of the Bell inequalities from the full set (16) then 
this state is characterized by the information-theoretical 
criterion (5). 

By performing rotations in the x-y planes of the N 
observers one can vary the values of the elements of the 
correlation tensor, but these variations do not change the 
left-hand side of inequality (25). In information-theoretic 
language we say that the total information content in x- 
y plane correlations is invariant under these variations. 
The invariance property implies that one can find local 
coordinate systems for which some of the correlation ten- 
sor elements vanish thus having criterion (25) which in- 
volves a smaller number of them (For the case of three 
qubits see Scarani and Gisin, 2001). For example, in 
the two-qubit case the rotations in the x-y planes of the 
two observers are obtained with the use of two param- 
eters, each describing the rotation angle for the given 
local observer, and therefore one can always find local 
coordinate systems such that two of correlation tensor 
elements vanish (T xy = T yx = 0) (Sec Horodecki and 
Horodccki, 1996). Then it can easily be seen that vary- 
ing the two angles a\ and «2 the expression on the left 
hand side of inequality (24) can be saturated by the one 
on the right hand side. Finally this establishes the con- 
dition (25) for two qubits as the necessary and sufficient 
condition for the correlations measured on an arbitrary 
two-qubit mixed state to be understood within the local 
realistic picture. In that case our necessary and sufficient 
condition I xx + I yy > 1 (or equivalently T xx +T yy > 1) for 
violation of the most general Bell's inequality is equiva- 
lent with the necessarily and sufficient condition for vio- 
lation of the CHSH inequality which was obtained by the 
Horodeckis (1995). Thus, our result also confirms that 
non-violation of the CHSH inequality is a necessary and 
sufficient condition for the local realistic description of 
two-qubit correlations. 

We will now analyze from our information theoretic 
point the case of an N-qubit Bell- type experiment. We 
are specifically interested in the limit up to which the 
experiment still has a local realistic interpretation. Con- 
sider the state which is a mixture of the maximally en- 
tangled state and the noise induced by experimental im- 
perfections. Such a state is known as the Werner state 
and has the form 

p W = V\lp GH z}(lpGHz\ + (1 - V)p noise (26) 

where \^ghz) = ^(| + z)\ — \ + z) N + 1 - z)i — \ ~ z)n) is 
the maximal entangled (GHZ) state and p no ise — jn! is 
the completely mixed state. Here e.g., | +z)j denotes the 
spin up of particle j along z. We would like to emphasize 
that the weight V of the GHZ-state can operationally be 
interpreted as the visibility observed in a multi-particle 
interference experiment (Belinskii and Klyshko, 1993). 

For the purposes of our argument we will now calcu- 
late the number of non-zero correlation tensor elements 



(which are related to our individual measures of informa- 
tion contained in the correlations) for the Werner state. 
Note first that for any measurement direction ft belong- 
ing to the x-y plane the spin component n ■ S has its 
eigenvectors in the form | ± n) = -^(| + z) ± e l ^\ — z)), 
where <j) is the azimuthal angle of the vector n. Using Eq. 
(19) one can easily show that the correlation function for 
arbitrary chosen measurement directions within the local 
x-y plains is 

N 

E w (cp 1 ,...,<p N ) = Vcos(J2<l>k)- (27) 
fe=i 

This implies that the correlation tensor elements T Xl .„ XN 
with xi,...,x^ each being either x or y are given by 
T X1 . ..xn — V cos(m y ^), where m y is the number of y's 
in {xi, ...,x N }. Therefore, for each N one always has 
T xx .. x — V. The other components are zero except 
for those that contain an even number of y's, which 
are either V or -V. This results in the total number 
1 + Sfe^i (2I) — 2 JV ~ 1 of non-zero components for the 
even N and 1 + £^ 1)/2 Q = 2 N ~ 1 for the odd N. 

We would like to stress again the equivalence I Xl ... XN — 
T~, x „ between the measure of information contained in 
the correlation between measurements performed along 
directions x\,...,Xn and the square of the correspond- 
ing correlation tensor element. In the case of N qubits 
in the Werner state we therefore have 2 JV ~ 1 individual 
measures of information with the value V 2 and we have 
the remaining ones equal to zero. Inserting these values 
into the information criterion (7) (or equivalently (25)) 
one obtains 

Vi^) (28) 

for the maximal visibility which still allows the correla- 
tions between N qubits in the Werner state to be un- 
derstood within a local realistic picture. Note that in 
such a case the right-hand side of the inequality (24) 
is one. Since the value given on the right-hand side of 
(28) is also the minimal visibility necessarily to violate 
the inequalities for N qubits obtained by Belinskii and 
Klyshko (1993) and since they are included in our set of 
all possible inequalities (16) we therefore can conclude 
that our information criterion (7) is the necessary and 
sufficient condition for correlations between N qubits in 
the Werner state to violate the local realistic description. 

It is interesting to note that different inequalities can 
be obtained for correlations to be understood within lo- 
cal realism where not only two but three (Zukowski and 
Kaszlikowski, 1997) or even all possible measurement set- 
tings (Zukowski, 1993) are chosen by N observers. There 
even lower thresholds for the visibilities were obtained to 
violate the local realistic description. For such an exper- 
imental situation our criterion (7) is the sufficient condi- 
tion for violation of the inequalities. 
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Another interesting observation is that for N maxi- 
mally entangled qubits N bits of information rest in the 
correlations, as opposed to always not more than one bit 
for the classically composed ones. In the case of GHZ (for 
V = 1 in the consideration given above) the information 
criterion (7) results in 2 N ~ 1 < 1 which clearly shows 
that with growing N the discrepancy between quantum 
and classical correlations grows exponentially. This is 
in concurrence with the fact that the GHZ theorem is 
stronger than Bell's and its strength, as measured by the 
magnitude of violation of (16) for maximally entangled 
states, exponentially increases with the number of qubits 
(Mermin, 1990; Ardehali, 1992, Belinskii and Klyshko, 
1993). 

VII. CONCLUSIONS 

We now would like to review what we have done in 
the present paper and put it in a broad perspective. The 
paper contains two independent main approaches to the 
question of quantum entanglement, and we finally show 
their essential equivalence. 

In the first approach we start from the conceptual po- 
sition that quantum mechanics is about information. We 
express the information contained in composite systems 
such that it can be divided into the information carried 
by the individuals versus the information contained in the 
correlations between observations made on the individu- 
als. We further assume that the information contained 
in any system, be it individual or composite, is finite. 

Considering first the classically composed systems we 
note that any correlations we might observe between the 
subsystems of such a composite system can simply be un- 
derstood on the basis of correlations between the prop- 
erties the individual subsystems have on their own. This 
means that if we know all properties of the individual 
subsystems we can definitely conclude how much infor- 
mation is contained in their correlations. For the quan- 
tum entangled systems this is not true anymore. Such 
composite quantum systems can carry more information 
in joint properties than what may be concluded from 
knowledge of the individuals. These considerations lead 
to a natural information-based understanding of quan- 
tum entanglement. Within this view we see Mermin's 
(1998) "correlations without correlata" as reflecting that 
when correlations are defined there is no information left 
to define "correlata" as well. "Correlations have physical 
meaning; that which they correlate does not," as stated 
by Mermin (1998). 

In an independent approach we obtain the most gen- 
eral set of Bell inequalities for N qubits. That way we ar- 
rive at a necessary and sufficient condition for quantum 
states whose correlations cannot be understood within 
local realism. Local realism is based on the assump- 
tion that results of the observations on the individual 



systems arc predetermined and independent of whatever 
measurements might be performed distantly. One may 
notice that this assumption implicitly says that correla- 
tions between subsystems do not go beyond what might 
be concluded from the properties of individual subsys- 
tems. 

We finally show that the two approaches, the informa- 
tion theoretical one and the one via Bell's inequalities, 
are equivalent in their essence. This is done via the fact 
that the Bell inequality criteria can be translated into a 
statement about correlations (probabilities), which again 
can be understood as an information theoretical expres- 
sion. This requires the use of a new measure of infor- 
mation introduced earlier (Brukner and Zeilinger, 1999). 
This measure of information is distinct from Shannon's 
measure (Shannon, 1948). The main conceptual differ- 
ence is that Shannon's measure tacitly assumes that the 
properties of the systems carrying the information arc 
already well defined prior to, and independent of, obser- 
vation (Brukner and Zeilinger 2001). In quantum me- 
chanics this clearly is not the case. There, the criterion 
for choosing the new measure of information was that 
it is invariant on the experimentalist's free choice of a 
complete set of mutually complementary observables. 

Summing up, we would like to draw the reader's atten- 
tion to the fact of the equivalence of the two approaches 
in the present paper, the information theoretic one and 
the one via Bell's inequalities. It is evident that the first 
one is both conceptually and formally much simpler. It is 
suggestive that this new information theoretic formula- 
tion of quantum phenomena opens up the avenue of new 
approaches to well known problems in quantum informa- 
tion physics and in the foundations of quantum mechan- 
ics. 
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